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US Patent & Trademark Office 
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8.©Pii6st|on.for.^ 

Swaminathan Sivasubramanian, Michal Szymaniak, Guillaume Pierre, Maarten van Steen 
September 2004 ACM Computing Surveys (CSUR), volume 36 issue 3 

Full text available: ^pdf[374.99 KB) Additional Information: full citation , abstract, references , index terms 

Replication is a well-known technique to improve the accessibility of Web sites. It generally 
offers reduced client latencies and increases a site's availability. However, applying 
replication techniques is not trivial, and various Content Delivery Networks (CDNs) have 
been created to facilitate replication for digital content providers. The success of these CDNs 
has triggered further research efforts into developing advanced <i>Web replica hosting 
systems </i>. These are systems that ... 



Keywords: Web replication, content delivery networks 



2 Streaming Media 
Frank LaMonica 

January 2001 Linux Journal 

Full text available: (^.MoiJi24.M KSi Additional Information: Ml citation, abstract references, index terrjis 

LaMonica describes the hardware and software technology used on the server side of the 
streaming process. 

3 AppHcatjo^ 

Balachander Krishnamurthy, Richard Liston, Michael Rabinovich 

May 2003 Proceedings of the twelfth international conference on World Wide Web 

Full text available: fil ;xlff331.13 KB) Additional Information: full diation. abstract references, index terms 



With a key component of latency on the Web being connection set up between clients and 
Web servers, several ways to avoid connections have been explored. While the work in 
recent years on Content Distribution Networks (CDNs) have moved some content 'closer' to 
users at the cost of increasing DNS traffic, they have not fully exploited the available unused 
potential of existing protocols. We explore ways by which a variety of Web responses can be 
piggybacked on DNS messages. While we evaluated our ... 

4 Traffic ch^ 

Kunwadee Sripanidkulchai, Bruce Maggs, Hui Zhang 
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October 2004 Proceedings of the 4th ACM SIGCOMM conference on Internet 
measurement 

Full text available: ^.pdj£3 Additional Information: fv!!l.c[tation J abstract, reterenceSj index te^s 

In this paper, we study the live streaming workload from a large content delivery network. 
Our data, collected over a 3 month period, contains over 70 million requests for 5,000 
distinct URLs from clients in over 200 countries. To our knowledge, this is the most 
extensive data of live streaming on the Internet that has been studied to date. Our 
contributions are two-fold. First, we present a macroscopic analysis of the workload, 
characterizing popularity, arrival process, session duration, an ... 

Keywords: content delivery networks, live streaming 



5 AppijcMions,..^ I I 

point 

G. Manjunath, T. Simunic, V. Krishnan, J. Tourrilhes, D. Das, V. Srinivasmurthy, A. McReynolds 
October 2004 Proceedings of the 2nd ACM international workshop on Wireless mobile 
applications and services on WLAN hotspots 

Full text available: ^pdf( ; 410J8.KB} Additional Information: Ml cjtatioQ. abstract references, index .terms 

Wireless access at cafes, airports, homes and businesses have proliferated all over the globe 
with several different Wireless Internet Service Providers. Similarly, digital media has 
created a paradigm shift in media processing resulting in a complete change in media usage 
models, revamped existing businesses and has introduced new industry players. We believe 
there is a tremendous opportunity for application and system services at the intersection of 
the above two domains for exploiting the ... 

Keywords: access point, low-power, management, media, security, wireless 



6 Dynamic. pamiM^ 

Pablo Rodriguez, Ernst W. Biersack 

August 2002 IEEE/ACM Transactions on Networking (TON), volume 10 issue 4 

Additional Information: station , abstract, references, citings , if-clex 



Full text available: f§ slf£35Q t 9§. KB i 



terms 



Popular content is frequently replicated in multiple servers or caches in the Internet to 
offload origin servers and improve end-user experience. However, choosing the best server 
is a nontrivial task and a bad choice may provide poor end user experience. In contrast to 
retrieving a file from a single server, we propose a parallel-access scheme where end users 
access multiple servers at the same time, fetching different portions of that file from 
different servers and reassembling them locally. ... 

Keywords: HTTP, content distribution, internet, mirroring, parallel access, peer-to-peer, 
replication, web 



7 ErangipanL 

Chandramohan A. Thekkath, Timothy Mann, Edward K. Lee 

October 1997 ACM SIGOPS Operating Systems Review , Proceedings of the sixteenth 

ACM symposium on Operating systems principles, Volume 31 issue 5 
Full text available: ^.pdf(2,20„MBj Additional Information: fvjjLcltaiion, references, .citings, index terra 



High performance data broadcasting systems 
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Peter Triantafillou, R. Harpantidou, M. Paterakis 

August 2002 Mobile Networks and Applications, volume 7 issue 4 

Full text available: ^^Gf(3B2^65.KB} Additional Information: Mi citation, abstract, references, index tenr£ 

Data broadcasting as a means of efficient data dissemination is a key technology facilitating 
ubiquitous computing. For this reason, broadcast scheduling algorithms have received a lot 
of attention. However, all existing algorithms make the core assumption that the data items 
to be broadcast are immediately available in the transmitter's queue, ignoring the key role 
that the disk subsystem and the cache management play in the overall broadcast system 
performance. With this paper we contribute a ... 

Keywords: broadcast scheduling, cache management, data dissemination, disk scheduling, 
system performance 



9 Putting it together: Building big, reliable web sites, (part 1) Q 
Win Treese 

March 2002 netWorker, volume 6 issue l 



In the early days of the Web, most users were not terribly concerned with performance or 
reliability. Sometimes Web sites were slow, sometimes they did not respond, but that was to 
be expected. Besides, many users had slow connections to the network, so they hardly 
noticed if the server was slow. A popular server might be overloaded for a while, but then it 
would recover. 

1 0 The. trayaj^ Q 
Carole Goble, Simon Harper, Robert Stevens 

May 2000 Proceedings of the eleventh ACM on Hypertext and hypermedia 

Full text available: ^ pgK229.51 KB) Additional Information: fcjj citation, references, sliinga. index ieros 



Keywords: hypertext, mobility, navigation, travel, usability, visual impairment, web 



11 Low power SOCs and NOCs: Disk drive energy optimization for audio-video | | 

applications 

Ravishankar Rao, Sarma Vrudhula, Musaravakkam S. Krishnan 

September 2004 Proceedings of the 2004 international conference on Compilers, 
architecture, and synthesis for embedded systems 

Full text available: ^ srfft6-53.24 KB: Additional Information: h;H citation, abstract, references , index terms 

Earlier techniques for low power speed control in disk drives running audio/video 
applications attempted to either match the drive's speed to the data rate requirement of the 
host application (just-in-time speed), or run it at the maximum drive speed, neither of which 
are energy-optimal in general. Starting from the theory of DC motors, we obtain a high-level 
power model of a disk drive. We then analytically obtain the speed profile (function of time) 
that minimizes the energy required to transf ... 

Keywords: disk drive, low power, multimedia, speed control 




Additional Information: MLcitation, abstract, index jenm 



12 Network behavior: An analysis of Internet content delivery systems 

Stefan Saroiu, Krishna P. Gummadi, Richard J, Dunn, Steven D. Gribble, Henry M. Levy 
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December 2002 ACM SIGOPS Operating Systems Review, volume 36 issue si 
Full text available: ^|?oft207.MB}. Additional Information: fyjj. citatjon, abstract, references 

In the span of only a few years, the Internet has experienced an astronomical increase in 
the use of specialized content delivery systems, such as content delivery networks and peer- 
to-peer file sharing systems. Therefore, an understanding of content delivery on the Internet 
now requires a detailed understanding of how these systems are used in practice.This paper 
examines content delivery from the point of view of four content delivery systems: HTTP 
web traffic, the Akamai content delivery netw ... 

13 Web and e-business application: User adaptive content delivery mechanism on the 
wgrjd.wide.web 

Tadashi Nakano, Kaname Harumoto, Shinji Shimojo, Shojiro Nishio 

March 2002 Proceedings of the 2002 ACM symposium on Applied computing 

Full text available: ^.pcsft1 A p.Q.M8J Additional Information: fulj.citatian, abstract, relereQC.es, index. te-rns 

To reduce the user-perceived latency in web content delivery, many techniques have been 
proposed. One is a transmission time control mechanism that automatically adjusts the 
quality of inline objects, such as images on a web page, according to the client network 
bandwidth. Another is a transmission order control mechanism that can transmit inline 
objects in a specified order preferred by users. In this paper, we describe the development 
of a user adaptive content delivery mechanism that integrat ... 

Keywords: HTTP extension, WWW, content adaptation, content delivery, quality of service, 
transmission order control, transmission time control, user profile 



14 On-line extraction of SCSI disk drive parameters 

Bruce L. Worthington, Gregory R. Ganger, Yale N. Patt, John Wilkes 

May 1995 ACM SIG METRICS Performance Evaluation Review , Proceedings of the 1995 
ACM SIG METRICS joint international conference on Measurement and 
modeling of computer systems, volume 23 issue l 

Full text available: ^ ^(1.21 MB) Additional ,nformation: Miration, s^tract, .references, citings, index 

Sophisticated disk scheduling algorithms require accurate, detailed disk drive specifications, 
including data about mechanical delays, on-board caching and prefetching algorithms, 
command and protocol overheads, and logical-to-physical block mappings. Comprehensive 
disk models used in storage subsystem design require similar levels of detail. We describe a 
suite of general-purpose algorithms and techniques for acquiring the necessary information 
from a SCSI disk drive. Using only the ANSI-standa ... 

morderincj 

Elizabeth Shriver, Arif Merchant, John Wilkes 

June 1998 ACM SIG METRICS Performance Evaluation Review , Proceedings of the 

1998 ACM SIGMETRICS joint international conference on Measurement and 
modeling of computer systems, volume 26 issue i 

Full text available: fitptiffl M MB) Additional ,nformatjon: MsiStiftO, abstract, references, citing index 
^ ~ terms 

Modern disk drives read-ahead data and reorder incoming requests in a workload-dependent 
fashion. This improves their performance, but makes simple analytical models of them 
inadequate for performance prediction, capacity planning, workload balancing, and so on. To 
address this problem we have developed a new analytic model for disk drives that do 
readahead and request reordering. We did so by developing performance models of the disk 
drive components (queues, caches, and the disk mechanism) and ... 
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Overlay networks have emerged as a powerful and highly flexible method for delivering 
content. We study how to optimize throughput of large transfers across richly connected, 
adaptive overlay networks, focusing on the potential of collaborative transfers between 
peers to supplement ongoing downloads. First, we make the case for an erasure-resilient 
encoding of the content. Using the digital fountain encoding approach, end hosts can 
efficiently reconstruct the original content of size <i>n ... 

Keywords: bloom filter, content delivery, digital fountain, erasure code, min-wise sketch, 
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This paper presents a systematic study of the properties of a large number of Web sites 
hosted by a major ISP. To our knowledge, ours is the first comprehensive study of a large 
server farm that contains thousands of commercial Web sites. We also perform a simulation 
analysis to estimate potential performance benefits of content delivery networks (CDNs) for 
these Web sites. We make several interesting observations about the current usage of Web 
technologies and Web site performance characteristi ... 
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Disk subsystem performance can be dramatically improved by dynamically ordering, or 
scheduling, pending requests. Via strongly validated simulation, we examine the impact of 
complex logical-to-physical mappings and large prefetching caches on scheduling 
effectiveness. Using both synthetic workloads and traces captured from six different user 
environments, we arrive at three main conclusions: (1) Incorporating complex mapping 
information into the scheduler provides only a ma ... 
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Overlay networks have emerged as a powerful and highly flexible method for delivering 
content. We study how to optimize throughput of large transfers across richly connected, 
adaptive overlay networks, focusing on the potential of collaborative transfers between 
peers to supplement ongoing downloads. First, we make the case for an erasure-resilient 
encoding of the content. Using the digital fountain encoding approach, end-hosts can 
efficiently reconstruct the original content of size $n$ from a ... 
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This paper analyzes various algorithms for scheduling low priority disk drive tasks. The 
derived closed form solution is applicable to class of greedy algorithms that include a variety 
of background disk scanning applications. By paying close attention to many characteristics 
of modern disk drives, the analytical solutions achieve very high accuracy— the difference 
between the predicted response times and the measurements on two different disks is only 
3% for all but one examined workload. This ... 
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DIXtrac is a program that automatically characterizes the performance of modern disk 
drives. This extended abstract overviews the contents of [3], which describes and validates 
DIXtrac's algorithms for extracting accurate values for over 100 performance-critical 
parameters in 2-6 minutes without human intervention or special hardware support. The 
extracted data includes detailed layout and geometry information, mechanical timings, 
cache management policies, and command processing overheads. ... 
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Fragment-based caching has been proposed as a promising technique for dynamic Web 
content delivery and caching. Most of these approaches either assume the fragment-based 
content is served by Web server automatically, or look at server-side caching only. There is 
no method of extracting fragments from an existing dynamic Web content, which is of great 
importance to thesuccess of fragment-based caching. Also, current technologies for 
supporting dynamic fragments do not allow to take into account c ... 
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CDN (Content Delivery Networks) improves end-user performance by replicating web 
contents on a group of geographically distributed content servers. Replication Algorithm 
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plays an important role in helping users to retrieve Web objects from the content servers. If 
a user can directly get the requested objects from the content server, he need not to 
contact the remote origin server and the user delay can be reduced. However, current 
replica strategies in CDN are to simply and repeatedly keep the c ... 
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